Task Space Tile Coding: In-Task and Cross-Task Generalization in Reinforcement Learning
نویسنده
چکیده
Exploiting the structure of a domain is an important prerequisite for being able to efficiently use reinforcement learning in larger state spaces. In this paper, we show how to benefit from the explicit representation of structural features in so-called structure space aspectualizable state spaces. We introduce task space tile coding as a mechanism to achieve generalization over states with identical structural properties. This leads to a significant improvement of learning performance. Policies learned with task space tile coding can also be applied to unknown environments sharing the same structure space and thus enable for a faster learning in new tasks.
منابع مشابه
Cycle Time Optimization of Processes Using an Entropy-Based Learning for Task Allocation
Cycle time optimization could be one of the great challenges in business process management. Although there is much research on this subject, task similarities have been paid little attention. In this paper, a new approach is proposed to optimize cycle time by minimizing entropy of work lists in resource allocation while keeping workloads balanced. The idea of the entropy of work lists comes fr...
متن کاملTile Coding Based on Hyperplane Tiles
In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-valu...
متن کاملProgress in learning 3 vs. 2 keepaway
Reinforcement learning has been successfully applied to several subtasks in the RoboCup simulated soccer domain. Keepaway is one such task. One notable success in the keepaway domain has been the application of SMDP Sarsa(λ) with tile-coding function approximation [9]. However, this success was achieved with the help of some significant task simplifications, including the delivery of complete, ...
متن کاملAssessment of manual material handling in a tile and ceramic factory using the National Institute for Occupational Safety and Health equation in 2016
Background: Manual handling, lifting, or carrying of material is responsible for non-fatal injuries among employees in industries. It is the second most prevalent reported risk factor in workplaces that can lead to potential manual handling accidents and longer-term musculoskeletal disorders (MSDs). The aim of this study was the evaluation of manual material handling using the American National...
متن کاملHierarchical Reinforcement Learning in Computer Games
Hierarchical reinforcement learning is an increasingly popular research field. In hierarchical reinforcement learning the complete learning task is decomposed into smaller subtasks that are combined in a hierarchical network. The subtasks can then be learned independently. A hierarchical decomposition can potentially facilitate state abstractions (i.e., bring forth a reduction in state space co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011